ETSI TS 126 104 V3.2.1 {2001 06) 

Technical Specification 



Universal Mobile Telecommunications System (UMTS); 

ANSI-C code for the floating-point AMR speech codec 

(3GPP TS 26.104 version 3.2.1 Release 1999) 



3!^^ 






y 



3GPP TS 26.104 version 3.2.1 Release 1999 1 ETSI TS 126 104 V3.2.1 (2001-06) 



Reference 



RTS/TSGS-04261 04UR3 
Keywords 



UMTS 



ETSI 

650 Route des Lucioles 
F-06921 Sophia Antipolis Cedex - FRANCE 

Tel.: +33 4 92 94 42 00 Fax: +33 4 93 65 47 16 

Siret N°348 623 562 00017 - NAF 742 C 
Association a but non lucratif enregistree a la 
Sous-Prefecture de Grasse (06) N° 7803/88 



Important notice 



Individual copies of the present document can be downloaded from: 
http://www.etsi.orq 

The present document may be made available in more than one electronic version or in print. In any case of existing or 

perceived difference in contents between such versions, the reference version is the Portable Document Format (PDF). 

In case of dispute, the reference shall be the printing on ETSI printers of the PDF version kept on a specific network drive 

within ETSI Secretariat. 

Users of the present document should be aware that the document may be subject to revision or change of status. 
Information on the current status of this and other ETSI documents is available at http://www. etsi . o rq/tb/status/ 

If you find errors in the present document, send your comment to: 
editor@etsi.fr 

Copyright Notification 

No part may be reproduced except as authorized by written permission. 
The copyright and the foregoing restriction extend to reproduction in all media. 

© European Telecommunications Standards Institute 2001. 
All rights reserved. 



£75/ 



3GPP TS 26.104 version 3.2.1 Release 1999 2 ETSI TS 126 104 V3.2.1 (2001-06) 



Intellectual Property Rights 



IPRs essential or potentially essential to the present document may have been declared to ETSI. The information 
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found 
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in 
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web 
server (http://www.etsi.org/ipr ). 

Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee 
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web 
server) which are, or may be, or may become, essential to the present document. 



Foreword 

This Technical Specification (TS) has been produced by the ETSI 3' Generation Partnership Project (3GPP). 

The present document may refer to technical specifications or reports using their 3GPP identities, UMTS identities or 
GSM identities. These should be interpreted as being references to the corresponding ETSI deliverables. 

The cross reference between GSM, UMTS, 3GPP and ETSI identities can be found under www.etsi.org/kev . 
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Foreword 



rd , 



This Technical Specification (TS) has been produced by the 3' Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 



£75/ 



3GPP TS 26.104 version 3.2.1 Release 1999 5 ETSI TS 126 104 V 3.2.1 (2001-06) 



1 Scope 



This Technical Standard (TS) contains an electronic copy of the ANSI-C code for a floating-point implementation of 
the Adaptive Multi-Rate codec. This floating-point codec specification is mainly targeted to be used in multimedia 
applications such as the 3G-324M terminal specified in 3GPP TS 26. 110, or in packet-based (e.g., H.323) applications. 
The bit-exact fixed-point ANSI-C code in 3GPP TS 26.073 remains the preferred implementation for all applications, 
but the floating-point codec may be used instead of the fixed-point codec when the implementation platform is better 
suited for a floating-point implementation. It has been verified that the fixed-point and floating-point codecs 
interoperate with each other without any artifacts. 

The floating-point ANSI-C code in this specification is the only standard conforming non-bit-exact implementation of 
the Adaptive Multi Rate speech transcoder (3GPP TS 26.090 [2]), Voice Activity Detection (3GPP TS 26.094 [6]), 
comfort noise generation (3GPP TS 26.092 [4]), and source controlled rate operation (3GPP TS 26.093 [5]). The 
floating-point code also contains example solutions for substituting and muting of lost frames (3GPP TS 26.091 [3]). 

The fixed-point specification in 26.073 shall remain the only allowed implementation for the 3G mandatory 
speech service and the use of the floating-point codec is strictly limited to other services. 

The floating-point encoder in this specification is a non-bit-exact implementation of the fixed-point encoder producing 
quality indistinguishable from that of the the fixed-point encoder. The decoder in this specification is functionally a 
bit-exact implementation of the fixed-point decoder, but the code has been optimized for speed and the standard fixed- 
point libraries are not used as such. 



Normative references 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[1] 3GPP TS 26.074: "AMR Speech Codec; Test sequences". 

[2] 3GPP TS 26.090: "AMR Speech Codec; Speech transcoding". 

[3] 3GPP TS 26.091: "AMR Speech Codec; Substitution and muting of lost frames". 

[4] 3GPP TS 26.092: "AMR Speech Codec; Comfort noise aspects". 

[5] 3GPP TS 26.093: "AMR Speech Codec; Source controlled rate operation". 

[6] 3GPP TS 26.094: "AMR Speech Codec; Voice Activity Detection". 

[7] 3GPP TS 26.073: "ANSI-C code for the Adaptive Multi Rate speech codec". 

[8] 3GPP TS 26.101: "AMR Speech Codec Frame Structure". 
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3 Definitions and abbreviations 

3.1 Definitions 

Definition of terms used in the present document, can be found in 3GPP TS 26.090 [2], 3GPP TS 26.091 [3], 3GPP 
TS 26.092 [4], 3GPP TS 26.093 [5], and 3GPP TS 26.094 [6]. 

3.2 Abbreviations 

For the purpose of the present document, the following abbreviations apply: 

ANSI American National Standards Institute 

ETS European Telecommunication Standard 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 



C code structure 



This clause gives an overview of the structure of the floating-point C code and provides an overview of the contents and 
organization of the C code attached to this document. The basic structure of the floating-point C code follows that of the 
bit-exact fixed-point code [7]. 

The C code has been verified on the following systems: 

IBM PC/AT compatible computers with Windows NT40 and Microsoft Visual C-n- v.5.0 compiler; 

HP workstations and GNU gcc compiler; 

IBM PC/AT compatible computers with Linux operating system and GNU gcc compiler; 
ANSI-C 9899 was selected as the programming language because portability was desirable 

4.1 Contents of the C source code 

The C code distribution has all files in the root level. 

The files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM data is 
contained in "rom" files with suffix "h". 

The C code does not contain any speech coder installation verification data files. Verification for the bit-exact decoder 
is defined in specification 3GPP TS 26.073 [7]. 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder and all the object files. 
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4.2 Program execution 

The Adaptive Multi-Rate codec is implemented in two programs: 

(encoder) speech encoder; 

(decoder) speech decoder. 

The programs should be called like: 

encoder [-dtx] mode speech_file bitstream_file 

or 

encoder [-dtx] -modefile=mode_file speech_file bitstream_file 

decoder <parameter file> <speech output file> 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 

See the file readme.txt for more information on how to run the encoder and decoder programs. 



4.3 Coding style 



The C code has been written according to structuring conventions used in 3GPP TS 26.073 [7]. Encoder and decoder 
state structures are allocated and initialized with special initializing functions. There are no separate functions for each 
module, as opposed to the fixed-point implementation in 3GPP TS 26.073 [7]. 

4.4 Code hierarchy 

The code hierarchy follows the one specified in 3GPP TS 26.073 [7]. 

Figures 1 to 4 are call graphs that show the functions used in the speech codec, including the functions of VAD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighboring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions, such as printf(), fwrite(), etc., have been omitted. 

The encoder call graph is broken down into three separate call graphs, shown in Tables 1 to 3. 
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Table 1 : Speech encoder call structure 



Speech_Encode__Frame 



Pre„Process 



tx_dtx_handler 



Ipc 



Isp 



dtx_buffer 



dtx_enc 



checkjsp 



pre_big 



oljtp 



vad_pitch_detection 



subframePreProc 



cljtp 



cbsearch 



gainQuant 



update_gp_clipping 



subframePostProc 



filter bank 



vad_decision 



first_filter_5tage 



filters 



level_calculation 



complex_estimate_adapt 



complex_vad 



noi5e_e5timate_update 



hangover_addition 



update_cntrl 



Autocorr 



Levinson 



Azjsp 



Q_plsf_5 



Int_lpc_1and3_2 



Int_lpc_1and3 



Q_plsf_3 



lnt_lpc_1to3_2 



lnt_lpc_1to3 



Dotproduct4Q 



Lspjsf 



Reorderjsf 



Lsfjsp 



Q_plsf_3 



Chebps 



Lspjsf 



Lsf_wt 



Vq_subvec 



Vq_subvec_s 



Reorderjsf 



Lsfjsp 



Lsp_az 



Lsp_az 



Lspjsf 



Lsf_wt 



Vq_subvec3 



Vq_subvec4 



Reorderjsf 



Lsfjsp 



Lsp_az 



Lsp_az 



GetJsp_pol 



GetJsp_pol 



GetJsp_pol 



GetJsp_pol 



Lspjsf 



Lsf_wt 



Vq_subvec3 



Vq_subvec4 



Reorderjsf 



Lsfjsp 



Weight_Ai 



Residu 



Synjilt 



Pitch_ol 



Pitch_ol_wgh 



vadJone_detection_update 



Lag_max 



comp_corr 



hp_max 



comp_corr 



Lag_max_wgfit 



gmed_n 



hp_max^ 



vad Jo n e_d etecti n 



vadJone_detection_update 



vad Jo n e_d etecti n 



Weight_Ai 



Synjilt 



Residu 



PredJt_3or6 



G_pitch 



check_gp_clipping 



q_gain_pitch 



see Table 2 



see Table 3 



Copy 



Synjilt 



getRange 



Norm_Corr 



searcfiFrac 



Encjag3 



EncJagS 



Dotproduct40 



lnterpol_3or6 



Dotproduct40 
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Table 2: cbsearch call structure 



cbsearch 


code_2i40_9bits 


cor h X 


Dotproduct40 


set sign 




cor h 


Dotproduct40 


search 2140 9bits 




build code 2i40 9bits 


code_2i40_11bits 


cor In X 


Dotproduct40 


set sign 




cor h 


Dotproduct40 


search 2140 11 bits 




build code 2i40 11 bits 


code_3i40_14bits 


cor h X 


Dotproduct40 


set sign 




cor h 


Dotproduct40 


search 3i40 




build code 3i40 14bits 


code_4i40_17bits 


cor h X 


Dotproduct40 


set sign 




cor h 


Dotproduct40 


search 4i40 




build code 4i40 




code_8i40_31bits 


cor h X 


Dotproduct40 


set sign12k2 


Dotproduct40 


cor h 


Dotproduct40 


search Bi40 




build code 8i40 31 bits 




compress code 


compresslO 


code_10i40_35bits 


cor h X 


Dotproduct40 


set sign12l<2 


Dotproduct40 


cor h 


Dotproduct40 


search 10i40 




build code 10i40 35bits 




q_P 





Table 3: gainQuant call structure 



gainQuant 


gc pred 


Dotproduct40 






calc fill energies 


Dotproduct40 




Dotproduct40 






MR475 update unq pred 




MR475 gain quant 


gc pred 


Dotproduct40 


q gain code 






MR795_gain_quant 


q gain pitch 




I\/1R795 gain code quantS 




calc unfilt energies 


Dotproduct40 


gain adapt 


Gmed n f 


I\/1R795 gain code quant mod 




Qua gain 
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Table 4: Speech decoder call structure 



Speech_Decode_Frame 


Decoder_amr 


rx dtx handler 








Decoder amr reset 






dtxdec 


Copy 






Lsf Isp 




D pisf 3 


Lsf Isp 




pseudonoise 






Lsp lsf 




Reorder lsf 




Lsp Az 


Get Isp pol 




A Refl 






Log2 


Log2 norm 




Pow2 






Build CN code 


pseudonoise 




Syn flit 






Lsf Isp 






Isp avg 




Build CN param 




D pisf 3 


Lsf Isp 




Int Ipc 1to3 


Lsp Az 


Get Isp pol 




D_plsL5 


Reorder lsf 






Lsf Isp 




Int Ipc lands 


Lsp Az 


Get Isp pol 




Dec Iag3 








Pred It 3or6 40 




Dec lags 




decode 2140 9bits 




decode 2140 11 bits 




decode 3140 14bits 




decode 4140 17bits 




decode 8i40 31 bits 


decompress codewords 


decompresslO 




ec gain pitcti 


gmed n 






d gain pitch 






ec gain pitch update 




decode 10i40 35bits 




Dec_gain 


Log2 


Log2 norm 




gc_pred 


Log2 


Log2 norm | 


Log2 norm 




Pow2 






gc pred update 




ec_gain_code 


gmed n 




gc pred average limited 




gc pred update 




ec gain code update 






d_gain_code 


gc_pred 


Log2 


Log2 norm | 


Log2 norm 




Pow2 






gc pred update 




Int lsf 






Cb gain average 




ph disp 




sqrt 1 exp 




Ex Ctrl 


gmed n 




agc2 


Inv sqrt 




Syn tilt 






Bgn scd 


gmed n 




dtx_dec_activity_update 


Copy 




Log2 


Log2 norm 




Isp avg 








Post_Filter 


Residu40 




Syn tilt 




age 


energy new 


energy old 




Inv sqrt 






Post Process 









4.5 



Variables, constants and tables 



The data types of variables and tables used in the floating-point implementation are signed integers in 2's complement 
representation, defined by: 

Words 8 bit variable 
UWordS 8 bit unsigned variable 

Wordl6 16 bit variable 
Word32 32 bit variable 

Floating-point numbers use the IEEE (Institute of Electrical and Electronics Engineers) format: 

Float32 8 bit exponent, 23 bit mantissa, 1 bit sign 

Float64 1 1 bit exponent, 52 bit mantissa, 1 bit sign 
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Furthermore some enum types are used, all possible to represent with one byte, and a boolean Flag. 

4.5.1 Description of constants used in \he C code 

Constants for the codec are defined in rom (h) files. 



4.5.2 Description of fixed tables used in tine C code 

This section contains a listing of all fixed tables sorted by source file name and table name. 

Table 5: Speech encoder fixed tables 



File 



Table name 



Type[Length] 



Description 



rom_enc.h 


tracklable 


Word8[4*5] 


rom_enc.h 


gammal 


Float32[10] 


rom_enc.h 


gamma1_12k2 


Float32[10] 


rom_enc.h 


gamma2 


Float32[10] 


rom_enc.h 


b60 


Float32]61] 


rom_enc.h 


startPosI 


Word16[2] 


rom_enc.h 


startPos2 


Word16[4] 


rom_enc.h 


startPos 


Word16[16] 


rom_enc.h 


corra/eight 


Float32[251] 


rom_enc.h 


qua_gain_pitch 


Float32[16] 


rom_enc.h 


qua_gain_pitch_MR1 2 

2 

qua_gain_code 


Float32[16] 


rom_enc.h 


Float32[64] 


rom_enc.h 


gray 


Word8[8] 


rom_enc.h 


grid 


Float32[61] 


rom_enc.h 


b24 


Fioat32[25] 


rom_enc.h 


lag_wind 


Float32[10] 


rom_enc.h 


lsp_init_data 


Float32[10] 


rom_enc.h 


past_rq_init 


Float32[80] 


rom_enc.h 


mean_lsf_3 


Float32[10] 


rom_enc.h 


mean_lsf_5 


Float32[10] 


rom_enc.h 


predjac 


Float32[10] 


rom_enc.h 


dico1_lsf_3 


Float32[3*256] 


rom_enc.h 


dico2_lsf_3 


Float32[3*512] 


rom_enc.h 


dico3_lsf_3 


Float32[4*512] 


rom_enc.h 


mr515_3_lsf 


Float32[4*128] 


rom_enc.h 


mr795_1_lsf 


Float32[3*512] 


rom_enc.h 


dico1_lsf_5 


Float32[4*128] 


rom_enc.h 


dico2_lsf_5 


Float32[4*256] 


rom_enc.h 


dico3_lsf_5 


Float32[4*256] 


rom_enc.h 


dico4_lsf_5 


Float32[4*256] 


rom_enc.h 


dico5_lsf_5 


Float32[4*64] 


rom_enc.h 


table_gain_MR475 


Float32[4*256] 


rom_enc.h 


table_gain_highrates 


Float32[128*3] 


rom_enc.h 


table_gain_lowrates 


Float32[64*3] 


rom_enc.h 


window_200_40 


Float32[240] 


rom_enc.h 


window_160_80 


Float32[240] 


rom_enc.h 


window_232_8 


Float32[240] 


rom_enc.h 


corra/eight 


Float32[251] 


rom_enc.h 


mode_dep_parm 


Word8[8*9] 



track table for algebraic code book search {MR475, MR515) 

spectral expansion factors 

spectral expansion factors 

spectral expansion factors 

interpolation filter coefficients 

track start search position for first pulse 

track start search position for second pulse 

track start search position 

weighting of the correlation function in open loop LTP search (MR102) 

adaptive codebook gain quantization table (MR795) 

adaptive codebook gain quantization table (MR122) 

fixed codebook gain quantization table (MR122, MR795) 

gray coding table 

grid points at wich Chebyshev polynomials are evaluated 

interpolation filter coefficients 

lag window table 

initialization table for Isp history in DTX 

initialization table for the MA predictor in DTX 

LSF means (not in MR122) 

LSF means (MR1 22) 

LSF prediction factors (not in MR122) 

1" LSF quantizer (not in MR122 and MR795) 

2"" LSF quantizer (not in MR122) 

3'" LSF quantizer (not in MR1 22, MR51 5 and MR475) 

3'" LSF quantizer (MR515 and MR475) 

1" LSF quantizer (IVIR795) 

1" LSF quantizer (IVIRI 22) 

2"" LSF quantizer (IVIRI 22) 

3'" LSF quantizer (MR1 22) 

4'" LSF quantizer (MR1 22) 

5'^ LSF quantizer (MR1 22) 

gain quantization table (MR475) 

gain quantization table (MR67, MR74 and MR102) 

gain quantization table (MR515 and MR59) 

LP analysis window (not in MR122) 

1" LP analysis window (l\/IR122) 

2"" LP analysis window (MR1 22) 

correlation weights 

parameters defining the adaptive codebook search per mode 
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Table 6: Speech decoder fixed tables 



File 



Table name 



Type[Length] 



Description 



rom_dec.h 
rom_dec.h 
ronn_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
ronn_dec.h 
rom_dec.h 
ronn_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
ronn_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
ronn_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
ronn_dec.h 
rom_dec.h 
rom_dec.h 
rom_dec.h 
rom dec.h 



dtx_log_en_adjust 

cdown 

pdown 

pred 

pred_MR122 

gamma3_MR122 

gammas 

gamma4_MR122 

gamma4 

bitno_MR475 

bitno_MR515 

bitno_MR59 

bitno_MR67 

bitno_MR74 

bitno_MR795 

bitno_MR102 

bitno_MR122 

bitno_MRDTX 

qua_gain_pitch 

qua_gain_code 

gray 

dgray 

sqrtjable 

inv_sqrt_table 

log2_table 

pow2_table 

cosjable 

acos_slope 

phj m p_low_M R795 

ph_imp_mid_MR795 

phjmpjow 

ph_imp_mid 

past_rq_init 

mean_lsf_3 

mean_lsf_5 

predjac 

dico1_lsf_3 

dico2_lsf_3 

dico3_lsf_3 

mr515_3_lsf 

mr795_1_lsf 

dico1_lsf_5 

dico2_lsf_5 

dico3_lsf_5 

dico4_lsf_5 

dico5_lsf_5 

table_gain_MR475 

table_gain_highrates 

table_gain_lowrates 

inter_6 

window_200_40 

table_speech_bad 

table_SID 

table_DTX 

table mute 



Word16[9] level adjustments for ech mode 
Word32[7] attenuation factors for codebook gain 
Word32[7] attenuation factors for adaptive codebook gain 
Word32[4] algebraic code book gain MA predictor coefficients 
Word32[4] algebraic code book gain MA predictor coefficients (MR1 22) 
Word32[10] spectral expansion factors 
Word32[10] spectral expansion factors 
Word32[10] spectral expansion factors 
Word32[10] spectral expansion factors 
Wordi 6[1 7] number of bits per parameter to transmit (MR475) 
Wordi 6[1 9] number of bits per parameter to transmit (MR51 5) 
Wordi 6[1 9] number of bits per parameter to transmit (MR59) 
Wordi 6[1 9] number of bits per parameter to transmit (MR67) 
Wordi 6[1 9] number of bits per parameter to transmit (MR74) 
Wordi 6[23] number of bits per parameter to transmit (MR795) 
Wordi 6[39] number of bits per parameter to transmit (MR1 02) 
Wordi 6[57] number of bits per parameter to transmit (MR1 22) 
Wordi 6[5] number of bits per parameter to transmit (MRDTX) 
Word32[1 6] adaptive codebook gain quantization table (MR1 22, MR795) 
Word32[96] fixed codebook gain quantization table (MR1 22, MR795) 
Word8[8] gray coding table 
Word8[8] gray decoding table 
Word32[49] table to compute sqrt(x) 
Word32[49] table used in inverse square root computation 
Word32[33] table used inbase 2 logharithm computation 
Word32[33] table used in 2 to the power computation 
Word32[65] table to compute cos{x) in Lsf_lsp{) 
Word32[64] table to compute acos{x) in Lsp_lsf() 
Word32[40] phase dispersion impulse response (MR795) 
Word32[40] phase dispersion impulse response (MR795) 
Word32[40] phase dispersion impulse response (MR475 - MR67) 
Word32[40] phase dispersion impulse response (MR475 - MR67) 
Word32[80] initialization table for the MA predictor in DTX 
Word32[1 0] LSF means (not in MR1 22) 
Word32[1 0] LSF means (MR1 22) 
Word32[1 0] LSF prediction factors (not in MR1 22) 

Word32[3*256]] l" LSF quantizer (not in MR122 and MR795) 

Word32[3*512] 2"" LSF quantizer (not in MR122) 

Word32[4*512] 3'" LSF quantizer (not in MR122, MR515 and MR475) 

Word32[4*128] 3'" LSF quantizer (MR515 and MR475) 

Word32[3*51 2] 1 " LSF quantizer (MR795) 

Word32[4*1 28] 1 " LSF quantizer (MR1 22) 

Word32[4*256] 2"" LSF quantizer (MR122) 

Word32[4*256] 3'" LSF quantizer (MR122) 

Word32[4*256] 4* LSF quantizer {MR122) 

Word32[4*64] 5* LSF quantizer (MR1 22) 

Word32[4*256] gain quantization table (MR475) 

Word32[128*4] gain quantization table (MR67, MR74 and MR102) 

Word32[64*4] gain quantization table (MR51 5 and MR59) 
Word32[61 ] interpolation filter coefficients 
Word32[240] LP analysis window (not in MR1 22) 
UWord8[9] comparison optimisation table in DTX 
Uword8[9] comparison optimisation table in DTX 
Uword8[9] comparison optimisation table in DTX 
Uword8[9] comparison optimisation table in DTX 
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4.5.3 Static variables used in tine C code 

In this section, two tables that specify the static variables for the speech encoder and decoder, respectively, are shown. 
All static variables are declared within a C struct. 

Table 7: Speech encoder static variables 



struct name 


Variable 


Type[Length] 


Description 


Speech_Encode_ 


cod_amr_state 


cod_amrState 


see below in this table 


FrameState 










pre state 


Pre ProcessState 


see below in this table 




dtx 


Word32 


Is set if DTX functionality is used 


Pre_ProcessState 


y2 


Float32 


filter state 




yi 


Word16Float32 


filter state 




xO 


Float32 


filter state 




x1 


Float32 


filter state 


cod_amrState 


old_speech 


Float32 [320] 


speech buffer 




speech 


Float32* 


pointer to current frame in old_speech 




p_window 


Float32* 


pointer to LPC analysis window in old_speech 




p_window_12k2 


Float32* 


pointer to LPC analysis window with no lookahead in 
old_speech(MR122) 




new_speech 


Float32* 


pointer to the last 1 60 speech samples in old_speech 




old_wsp 


Float32 [303] 


buffer holding spectral weighted speech 




wsp 


Float32* 


pointer to the current frame in old_wsp 




oldjags 


Word32[5] 


open loop LTP states 




oLgainJIg 


Float32 [2] 


enables open loop pitch lag weighting (MR102) 




old_exc 


Float32[314] 


excitation vector 




exc 


Float32* 


current excitation 




ai_zero 


Float32[51] 


history of weighted synth. filter followed by zero vector 




zero 


Float32* 


zero vector 




h1 


Float32* 


impulse response of weighted synthesis filter 




hvec 


Float32 [80] 


zero vector followed by impulse response 




IpcSt 


IpcState 


see below in this table 




IspSt 


IspState 


see below in this table 




cILtpSt 


cILtpState 


see below in this table 




gainQuantSt 


gainQuantState 


see below in this table 




pitchOLWghtSt 


pitchOLWghtState 


see below in this table 




tonStabSt 


tonStabState 


see below in this table 




vadSt 


vadState 


see below in this table 




vadSt2 


vadState2 


see below in this table 




dtx 


Word32 


is set if DTX functionality is used 




dtx_encSt 


dtx encState 


see below in this table 




mem_syn 


Float32[10] 


synthesis filter memory 




mem_wO 


Float32[10] 


weighting filter memory (applied to error signal) 




mem_w 


Float32[10] 


weighting filter memory (applied to input signal) 




mem_err 


Float32 [50] 


filter memory for production of error vector 




error 


Float32* 


error signal (input minus synthesized speech) 




sharp 


Float32 


pitch sharpening gain 


vadState 


bckr_est 


Float32 [9] 


background noise estimate 




avejevel 


Float32 [9] 


averaged input components for stationary estimation 




oldjevel 


Float32 [9] 


input levels of the previous frame 




subjevel 


Float32 [9] 


input levels calculated at the end of a frame (lookahead) 




a_data5 


Float32 [6] 


memory for the filter bank 




a_data3 


Float32 [5] 


memory for the filter bank 




burst_count 


Word16 


counts length of a speech burst 




hang_count 


Word16 


hangover counter 




stat_count 


Word16 


stationary counter 




vadreg 


Word32 


15 flags for intermediate VAD decisions 




pitch 


Word32 


15 flags for pitch detection 




tone 


Word16 


1 5 flags for tone detection 




complex_high 


Word16 


flags for complex detection 




complexjow 


Word16 


flags for complex detection 




oldlag_count 


Word32 


variables for pitch detection 




oldlag 


Word32 


variables for pitch detection 




complex_hang_count 


Word16 


complex hangover counter, used by VAD 




complex_hang_timer 


Word16 


hangover initiator, used by CAD 
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Struct name 


Variable 


Type[Length] 


Description 




best_corr_hp 
speech_vad_decision 
complex_warning 
sp_burst_count 
corr hp fast 


Float32 
Word16 
Word16 
Word16 
Word16 


filtered value 

final decision 

complex background warning 

counts length of a speech burst inci HO addition 

filtered value 


dtx_encState 


lsp_hist 
log_en_hist 
hist_ptr 
log_en_index 

init_lsf_vq_index 
Ispjndex 
dtxHangoverCount 
decAnaElapsedCount 


Float32[80] 
Float32 [8] 
Word16 
Word16 

Word32 
Word16[3] 
Word 16 
Wordie 


LSP history (8 frames) 
logarithmic frame energy history (8 frames) 
pointer to the cyclic history vectors 
Index for logarithmic energy 

initial index for Isf predictor 
Isp indecies to the three code books 
is decreased in DTX hangover period 
counter for elapsed speech frames in DTX 


IpcState 


LevinsonSt 


LevinsonState 


see below 


LevinsonState 


old A 


Float32[11] 


last frames direct form coefficients 


IspState 


lsp_old 
Isp old q 
qSt 


Float32[10] 
Float32[10] 
Q pIsfState 


old LSP vector 

old quantized LSP vector 

see below in this table 


Q_plsfState 


past_rq 


Float32[10] 


past quantized LSF prediction error 


cILtpState 


pitchSt 


Pitch frState 


see below in this table 


tonStabState 


count 

gp 


Wordie 
Float32[7] 


count consecutive (potential) resonance frames 
pitch gain history 


Pitch frState 


TO prev subframe 


Word32 


integer, pitch lag of previous subframe 


gainQuantState 


sfO_ gcodeO 

sfO_ target_en 

sfO_ coeff 

gain_idx_ptr 
gc_predSt 
gc_predUncSt 
adaptSt 


Float32 

Float32 

Float32 [5] 

Wordie* 
gc_predState 
gc_predState 
GainAdaptState 


subframe 0/2 codebook gain 

subframe 0/2 target energy 

subframe 0/2 energy coefficient 

pointer to gain index value in parameter frame 
see below in this table 
see below in this table 
see below in this table 


gc_predState 


past qua en 


Float32[4] 


MA predictor memory (20*log10(pred. error)) 


GainAdaptState 


onset 

prev_alpha 
prev_gc 
ltpg_mem 


Wordie 
Float32 
Float32 
Float32 [5] 


onset counter 
previous adaptor output 
previous codebook gain 
pitch gain history 


pitchOLWghtState 


old_TO_med 

ada_w 

wghtjig 


Word32 
Float32 
Wordie 


weighted open loop pitch lag 

weigthing level depeding on open loop pitch gain 

switches lag weighting on and off 



£75/ 



3GPPTS 26.104 version 3.2.1 Release 1999 



15 



ETSI TS 126 104 V 3.2.1 (2001-06) 



Table 8: Speech decoder static variables 



struct name 


Variable 


Type[Length] 


Description 


Speech_Decode_FrameSt 
ate 


decoder_amrState 


Decoder_amrState 


see below in this table 


post_state 


Post_FilterState 


see below in this table 




postHP_state 


Post ProcessState 


see below in this table 


Decoder_amrState 


old_exc 


Word32[194] 


excitation vector 




exc 


Word32* 


current excitation 




lsp_old 


Word32[10] 


LSP vector of previous frame 




mem_syn 


Word32[10] 


synthesis filter memory 




sharp 


Word32 


pitch sharpening gain 




old_TO 


Word32 


pitch sharpening lag 




prev_bf 


Word 16 


previous value of "bad frame" flag 




prev_pdf 


Word 16 


previous value of "pot. dangerous frame" flag 




state 


Word 16 


ECU state (0..6) 




excEnergyHist 


Word32[9] 


excitation energy history 




TO_lagBuff 


Word32 


received pitch lag for ECU 




inBackgroundNoise 


Word32 


background noise flag 




voicedHangover 


Word32 


hangover flag 




ItpGainHistory 


Word32[9] 


pitch gain history 




background_state 


Bgn_scdState 


see below in this table 




Cb_gain_averState 


Gb_gain_averageState 


see below in this table 




lsp_avg_st 


lsp_avg State 


see below in this table 




IsfState 


D_plsfState 


see below in this table 




ec_gain_p_st 


ec_gain_pitchState 


see below in this table 




ec_gain_c_st 


ec_gain_codeState 


see below in this table 




pred_state 


go predState 


see table 7 




nodataSeed 


Word 16 


seed for ON generator 




ph_disp_st 


ph_dispState 


see below in this table 




dtxDecoderState 


dtx decState 


see below in this table 


dtx_decState 


since_last_sid 


Word 16 


number of frames since last SID frame 




true_sid_period_inv 


Word 16 


inverse of true SID update rate 




log_en 


Word32 


logarithmic frame energy 




old_log_en 


Word32 


previous value of log_en 




pn seed rx 


Word32 


random number generator seed 




Isp 


Word32[10] 


LSP vector 




Isp old 


Word32[10] 


previous LSP vector 




IsLhist 


Word32[80] 


LSF vector history (8 frames) 




lsf_hist_ptr 


Word 16 


index to beginning of LSF history 




lsf_hist_mean 


Word32[80] 


mean-removed LSF history (8 frames) 




log_pg_mean 


Word 16 


mean-removed logarithmic prediction gain 




log_en_hist 


Word32[8] 


logarithmic frame energy history 




log_en_hist_ptr 


Wordie 


index to beginning of log, frame energy history 




log_en_adjust 


Word 16 


mode-dependent frame energy adjustment 




dtxHangoverCount 


Word 16 


counts down in hangover period 




decAnaElapsedCount 


Word 16 


counts elapsed speech frames after DTX 




sidjrame 


Word 16 


flags SID frames 




valid_data 


Word 16 


flags SID frames containing valid data 




dtxHangoverAdded 


Word 16 


flags hangover period at end of speech 




dtxGlobalState 


enum DTXStateType 


DTX state flags 




data_updated 


Word 16 


flags ONI updates 


Bgn_scdState 


frameEnergyHist 


Word32[60] 


history of synthesis frame energy 




bgHangover 


Word 16 


number of frames since last speech frame 


Cb_gain_averageState 


cbGainHistory 


Word32[7] 


codebook gain history 




hangVar 


Word 16 


counts length of talkspurt in subframes 




hangCount 


Wordie 


number of subframes since last talkspurt 


lsp_avgState 


lsp_meanSave 


Word32[10] 


averaged LSP vector 


D_plsfState 


past_r_q 


Word32[10] 


past quantized LSF prediction vector 




past Isf q 


Word32[10] 


past dequantized LSF vector 


ec_gain_pitchState 


pbuf 


Word32[5] 


pitch gain history 




past_gain_pit 


Word32 


previous pitch gain (limited to 1 .0) 




prev gp 


Word32 


previous good pitch gain 


ec_gain_codeState 


gbuf 


Word32[5] 


codebook gain history 




past_gain_code 


Word32 


previous codebook gain 




prev_gc 


Word32 


previous good codebook gain 


ph_dispState 


gainMem 


Word32[5] 


pitch gain history 




prevState 


Word32 


previously used impulse response 




prevCbGain 


Word32 


previous codebook gain 




lockFull 


Word 16 


force maximum phase dispersion 




onset 


Word 16 


onset counter 


Post_FilterState 


res2 


Word32[40] 


LP residual 




mem_syn_pst 


Word32[10] 


synthesis filter memory 




synth_buf 


Word16[170] 


synthesis filter work area 




agc_state 


agcState 


see below in this table 




preemph_state 


preemphasisState 


see below in this table 
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Struct name 


Variable 


Type[Length] 


Description 


agcState 


past_gain 


Word 16 


past age gain 


preemphasisState 


mem pre 


Word 16 


filter state 


Post_ProcessState 


y2 hi 
y2 lo 
y1 hi 
y1 lo 
xO 
x1 


Word32 
Word32 
Word32 
Word32 
Word32 
Word32 


filter state, upper word 
filter state, lower word 
filter state, upper word 
filter state, lower word 
filter state 
filter state 



Homing procedure 



The principles of the homing procedures are described in 3GPP TS 06.090 [2]. This specification only includes a 
detailed description of the 8 decoder homing frames. For each AMR codec mode, the corresponding decoder homing 
frame has a fixed set of speech parameters shown in table 9a-9h. The bit allocation within these parameters is identical 
to the corresponding bit allocation of the source encoder output parameters given in 3GPP TS 06.090 [2]. 

In the following tables, the following naming convention is used for the individual parameters. Letters in italics indicate 
numbers. 

LPC_nindex of nth LSF submatrix 

LTP-LAG m adaptive codebook index for subframe m 

LTP-GAIN m adaptive codebook gain index in subframe m 

- FCB-GAIN m fixed codebook gain index in subframe m 

GAIN_VQ m codebook gain VQ index in subframe m (subframe m and m+1 for MR475) 

- POS m_n position index of nth pulse in subframe m 

- POS m_n_k position index of nth and tth pulse in subframe m 

- POS m_n_k_lj position index of nth, Alh, fth, and jth pulse in subframe m 
SIGN m_n_k sign information for nth and k\h pulse in subframe m 

SIGN m_ n_k_ljsign information for nth, kth, fth, andyth pulse in subframe m 

SlGN_m_n_k_POS_m_n sign information for nth and Ath pulse and position index for nth pulse in 
subframe m 
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Table 9a: Parameter values for the decoder homing frame (MR475) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


0x009D 


LPC3 


0x0010 


LTP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0028 


LTP-LAG 2 


0x000 F 


POS 2 1 2 


0x0038 


SIGN 2 1 2 


0x0001 


LTP-LAG 3 


0x000 F 


POS 3 1 2 


0x0031 


SIGN 3 1 2 


0x0002 


GAIN-VQ 3 


0x0008 


LTP-LAG 4 


0x000 F 


POS 4 1 2 


0x0026 


SIGN 4 1 2 


0x0003 



Table 9b: Parameter values for the decoder homing frame (MR515) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


0x009D 


LPC3 


0x001 C 


LTP-LAG 1 


0x0066 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LTP-LAG 2 


0x000 F 


POS 2 1 2 


0x0000 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


0x0005 


LTP-LAG 3 


0x000 F 


POS 3 1 2 


0x0037 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LTP-LAG 4 


0x000 F 


POS 4 1 2 


0x0023 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x001 F 
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Table 9c: Parameter values for the decoder homing frame (MR59) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


OxOOES 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 1 1 2 


0x0000 


SIGN 1 1 2 


0x0003 


GAIN-VQ 1 


0x0037 


LTP-LAG 2 


0x000 F 


POS 2 1 2 


0x0001 


SIGN 2 1 2 


0x0003 


GAIN-VQ 2 


OxOOOF 


LTP-LAG 3 


0x0060 


POS 3 1 2 


0x00 F9 


SIGN 3 1 2 


0x0003 


GAIN-VQ 3 


0x0037 


LTP-LAG 4 


OxOOOF 


POS 4 1 2 


0x0000 


SIGN 4 1 2 


0x0003 


GAIN-VQ 4 


0x0037 



Table 9d: Parameter values for the decoder homing frame (MR67) 



Parameter 


Value (LSB=bO) 


LPG1 


0x00 F8 


LPG2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 


0x0002 


SIGN 112 3 


0x0007 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


OxOOOF 


POS 2 1 2 3 


0x0098 


SIGN 2 12 3 


0x0007 


GAIN-VQ 2 


0x0061 


LTP-LAG 3 


0x0060 


POS 3 1 2 3 


0x0505 


SIGN 3 12 3 


0x0007 


GAIN-VQ 3 


0x0000 


LTP-LAG 4 


OxOOOF 


POS 4 1 2 3 


0x0318 


SIGN 4 12 3 


0x0007 


GAIN-VQ 4 


0x0000 
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Table 9e: Parameter values for the decoder homing frame (MR74) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


0x001 B 


POS 2 12 3 4 


0x0208 


SIGN 2 12 3 4 


OxOOOF 


GAIN-VQ 2 


0x0062 


LTP-LAG 3 


0x0060 


POS 3 12 3 4 


0x1 BA6 


SIGN 3 12 3 4 


OxOOOF 


GAIN-VQ 3 


0x0000 


LTP-LAG 4 


0x001 B 


POS 4 12 3 4 


0x0006 


SIGN 4 12 3 4 


OxOOOF 


GAIN-VQ 4 


0x0000 



Table 9f : Parameter values for the decoder homing frame (MR795) 



Parameter 


Value (LSB=bO) 


LPC1 


0x0002 


LPC2 


0x00E3 


LPG3 


0x002F 


LTP-LAG 1 


OxOOBD 


POS 112 3 4 


0x0006 


SIGN 112 3 4 


OxOOOF 


LTP-GAIN 1 


OxOOOA 


FOB-GAIN 1 


0x0000 


LTP-LAG 2 


0x0039 


POS 2 12 3 4 


0x1008 


SIGN 2 12 3 4 


0x0007 


LTP-GAIN 2 


OxOOOA 


FOB-GAIN 2 


OxOOOB 


LTP-LAG 3 


0x0063 


POS 3 12 3 4 


0x1 1A6 


SIGN 3 12 3 4 


OxOOOF 


LTP-GAIN 3 


0x0001 


FOB-GAIN 3 


0x0000 


LTP-LAG 4 


0x0039 


POS 4 12 3 4 


0x09A0 


SIGN 4 12 3 4 


OxOOOF 


LTP-GAIN 4 


0x0002 


FOB-GAIN 4 


0x0001 
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Table 9g: Parameter values for the decoder homing frame (MR102) 



Parameter 


Value (LSB=bO) 


LPC1 


0x00 F8 


LPC2 


0x00E3 


LPC3 


0x002F 


LTP-LAG 1 


0x0045 


SIGN 1 1 5 


0x0000 


SIGN 1 2 6 


0x0000 


SIGN 1 3 7 


0x0000 


SIGN 1 4 8 


0x0000 


POS 112 5 


0x0000 


POS 1 3 6 7 


0x0000 


POS 1 4 8 


0x0000 


GAIN-VQ 1 


0x0000 


LTP-LAG 2 


0x001 B 


SIGN 2 1 5 


0x0000 


SIGN 2 2 6 


0x0001 


SIGN 2 3 7 


0x0000 


SIGN 2 4 8 


0x0001 


POS 2 12 5 


0x0326 


POS 2 3 6 7 


OxOOCE 


POS 2 4 8 


0x007E 


GAIN-VQ 2 


0x0051 


LTP-LAG 3 


0x0062 


SIGN 3 1 5 


0x0000 


SIGN 3 2 6 


0x0000 


SIGN 3 3 7 


0x0000 


SIGN 3 4 8 


0x0000 


POS 3 12 5 


0x01 5A 


POS 3 3 6 7 


0x0359 


POS 3 4 8 


0x0076 


GAIN-VO 3 


0x0000 


LTP-LAG 4 


0x001 B 


SIGN 4 1 5 


0x0000 


SIGN 4 2 6 


0x0000 


SIGN 4 3 7 


0x0000 


SIGN 4 4 8 


0x0000 


POS 4 12 5 


0x01 7C 


POS 4 3 6 7 


0x0215 


POS 4 4 8 


0x0038 


GAIN-VO 4 


0x0030 
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Table 9h: Parameter values for the decoder homing frame (MR122) 



Parameter 


Value (LSB=bO) 


LPC1 


0x0004 


LPC2 


0x002A 


LPC3 


OxOODB 


LPC4 


0x0096 


LPC5 


0x002A 


LTP-LAG 1 


0x0156 


LTP-GAIN 1 


OxOOOB 


SIGN 1 1 6 POS 1 1 


0x0000 


SIGN 1 2 7 POS 1 2 


0x0000 


SIGN 1 3 8 POS 1 3 


0x0000 


SIGN 1 4 9 POS 1 4 


0x0000 


SIGN 1 5 10 POS 1 5 


0x0000 


P0S1 6 


0x0000 


P0S1 7 


0x0000 


P0S1 8 


0x0000 


P0S1 9 


0x0000 


P0S1 10 


0x0000 


FCB-GAIN 1 


0x0000 


LTP-LAG 2 


0x0036 


LTP-GAIN 2 


OxOOOB 


SIGN 2 1 6 POS 2 1 


0x0000 


SIGN 2 2 7 POS 2 2 


OxOOOF 


SIGN 2 3 8 POS 2 3 


OxOOOE 


SIGN 2 4 9 POS 2 4 


OxOOOC 


SIGN 2 5 10 POS 2 5 


0x000 D 


POS 2 6 


0x0000 


POS 2 7 


0x0001 


POS 2 8 


0x0005 


POS 2 9 


0x0007 


POS 2 10 


0x0001 


FCB-GAIN 2 


0x0008 


LTP-LAG 3 


0x0024 


LTP-GAIN 3 


0x0000 


SIGN 3 1 6 POS 3 1 


0x0001 


SIGN 3 2 7 POS 3 2 


0x0000 


SIGN 3 3 8 POS 3 3 


0x0005 


SIGN 3 4 9 POS 3 4 


0x0006 


SIGN 3 5 10 POS 3 5 


0x0001 


POS 3 6 


0x0002 


POS 3 7 


0x0004 


POS 3 8 


0x0007 


POS 3 9 


0x0004 


POS 3 10 


0x0002 


FCB-GAIN 3 


0x0003 


LTP-LAG 4 


0x0036 


LTP-GAIN 4 


OxOOOB 


SIGN 4 1 6 POS 4 1 


0x0000 


SIGN 4 2 7 POS 4 2 


0x0002 


SIGN 4 3 8 POS 4 3 


0x0004 


SIGN 4 4 9 POS 4 4 


0x0000 


SIGN 4 5 10 POS 4 5 


0x0003 


POS 4 6 


0x0006 


POS 4 7 


0x0001 


POS 4 8 


0x0007 


POS 4 9 


0x0006 


POS 4 10 


0x0005 


FCB-GAIN 4 


0x0000 
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6 File formats 

This section describes the file formats used by the encoder and decoder programs. The test sequences defined in [2] also 
use the file formats described here. 

6.1 Speech file (encoder input / decoder output) 

Speech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 13-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 160 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n'*160 + k words, while the 
files produced by the decoder will always have a length of n'*160 words. 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one line per speech frame. Each line contains one of the mode names 
from the list {MR475, MR515, MR59, MR67, MR74, MR795, MR102, MR122}. 

6.3 Parameter bitstream file (encoder output / decoder input) 

The files produced by the speech encoder/expected by the speech decoder contain an arbitrary number of frames in 
AMR Interface Format 2. The format is described in TS 26.101 [8] Annex A. 

By using preprocessor definition encoder/decoder can optionally use format compatible with the existing AMR fixed- 
point C-code. Frame format is following. 

I FRAME_TYPE | B1 | B2 | ... | B244 | MODEJNFO | unusedl \ ... \ unused4 ~| 

Each box corresponds to one Wordl 6 value in the bitstream file, for a total of 250 words or 500 bytes per frame. The 
fields have the following meaning: 

FRAME_TYPE transmit frame type, which is one of 
TX_SPEECH (0x0000) 
TX_SID_FIRST (0x0001) 
TX_SID_UPDATE (0x0002) 
TX_NO_DATA (0x0003) 

B0...B244 speech encoder parameter bits (i.e. the bitstream itself)- Each Bx either has the value 0x0000 

or 0x0001. Only mode MR122 really uses all 244 bits; for the other modes, only the first n 
bits are used (35 < n < 204). The remaining bits are unused (written as 0x0000) 

MODEJNFO encoding mode information, which is one of 
MR475 (0x0000) 
MR515 (0x0001) 
MR59 (0x0002) 
MR67 (0x0003) 
MR74 (0x0004) 
MR795 (0x0005) 
MR102 (0x0006) 
MR122 (0x0007) 

unusedl ...4 unused, written as 0x0000 

As indicated in section 6.1 above, the byte order depends on the host architecture. 
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Annex A (informative): 
Change Request History 
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J^ 
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10 
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3.1.0 AIVIR Core Frame bit ordering (AIVIR speech Codec; Floating 
point C-Code 


12 


SP-010306 003 


1 


F 


R99 


3.1.0 


3.2.0 Limiting predicted codebook gain computing in encoder 


12 


SP-010306 005 


1 


F 


R99 


3.1.0 


3.2.0 


Correction of decoder operation in error concealment of lost 
frames 


12 


SP-010306 007 


1 


F 


R99 3.1 .0 ' 3.2.0 ' Correction of mode state bug in AIVIR decoder 


12 


SP-010306 011 


1 


F 


R99 3.1.0 3.2.0 Correction of decoder Reset 


12 


SP-010306 


013 


1 


F 


R99 


3.1.0 


3.2.0 


Correction of comfort noise parameter interpolation bug of AIVIR 
decoder 


12 


SP-010306 


015 
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F 


R99 


3.1.0 


3.2.0 
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